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CHAPTER  1 


Introduction 

The  increase  in  the  complexity  of  circuits  fabricated  on  a  single  semiconductor  chip  has 
tnnHi-  the  design  and  testing  of  the  circuits  more  difficult.  For  a  number  of  years,  circuits 
with  only  a  small  number  of  transistors  have  been  successfully  analyzed  using  circuit 
simulators  such  as  SPICE2[1].  However,  as  the  size  of  the  circuit  grows,  the  execution  time 
and  memory  requirements  of  these  simulators  become  prohibitive.  These  problems  have  led 
to  the  development  of  new  simulators  that  are  much  faster  and  less  memory  intensive  than 
SPICE-type  simulators  such  as  MOTIS[2],  MOTIS*C[3],  and  M OSTAP [4], 

The  new  simulators  developed  for  analyzing  LSI  and  VLSI  circuits  often  restrict  the  type  * 
of  circuits  that  they  can  deal  with.  This  knowledge  is  then  utilized  in  a  variety  of  ways  to 
quickly  produce  relatively  accurate  time  domain  waveforms  for  the  drcuits.  For  digital 
circuits,  this  means  that  the  process  of  design  verification  can  be  carried  out  on  a  level 
between  that  of  conventional  circuit  analysis  and  conventional  logic  simulation. 

One  of  the  most  common  techniques  used  in  large  scale  circuit  simulators  is  the 
decomposition  of  die  circuit  into  smaller  subcircuits.  Each  subcircuit  is  then  analyzed 
individually  and  the  results  are  combined  to  give  the  desired  data.  In  the  genera]  case,  this  is 
known  as  "tearing"  and  corresponds  to  partitioning  the  network  equations  in  particular 
way[5].  If  the  subcircuits  can  be  considered  unidirectional  (as  is  often  the  case  for  MOS 
circuits),  then  further  gains  in  speed  can  be  achieved  by  analysis  sequencing  and  latency 
checking[6J. 

A  second  approach  can  be  considered  as  temporal  decomposition  of  the  circuit.  In  this 
method,  known  as  waveform  relaxation,  the  solution  for  an  entire  time  interval  is 
approximated  using  only  a  limited  number  of  iterations  at  any  point  within  the  interval.  This 


sweep  of  the  whole  interval  can  then  be  repeated  until  the  solutions  converge[7].  However, 
if  it  can  be  justified  that  the  first  sweep  is  accurate  enough,  it  is  possible  to  discard  the 
following  sweeps  as  well  as  the  overhead  associated  with  fully  implementing  waveform 
relaxation. 

In  timing  simulators  such  as  MOTIS-C,  and  SPLICE[8],  a  Gauss-Seidel-like  technique  is 
used  to  decouple  the  network  equations  and  a  single  sweep  of  die  relaxation  technique  is 
taken.  This  approach  of  not  iterating  to  convergence  means  that  the  classical  numerical 
properties  of  an  algorithm  such  as  stability  and  convergence  may  not  hold.  Hence,  new 
studies  of  these  properties  become  necessary  to  provide  a  basis  for  the  use  of  an 
algorithm[9]. 

Wei  proved  in  Reference  [10]  that  the  standard  Gauss-Seidel  method  is  not  convergent 
when  floating  capacitors  exist  in  the  circuit.  This  helped  motivate  die  development  of  a 
modified  Gauss-Seidel  method  which  used  a  forward  predictor  to  estimate  the  values  of 
unsolved  variables  in  feedback  loops.  The  predictor  Gauss-Seidel  method,  shown  to  be  zero 
stable  and  convergent,  was  implemented  in  a  program  called  PREMOS  designed  for 
simulating  NMOS  circuits. 

The  work  described  in  this  thesis  covers  two  basic  areas.  The  first  is  a  further  study  of  the 
numerical  properties  of  the  standard  Gauss-Seidel  and  the  predictor  Gauss-Seidel  methods. 
A  modification  to  the  predictor  method  is  made  that  gives  it  certain  properties  of  both 
methods.  The  second  area  is  concerned  with  adding  the  capability  of  analyzing  some  CMOS 
circuits  to  the  PREMOS  program. 

Chapter  2  is  a  short  description  of  some  basic  techniques  used  in  numerical  circuit 
analysis.  The  properties  of  the  Gauss-Seidel,  predictor  Gauss-Seidel,  and  modified  predictor 
Gauss-Seidel  are  studied  in  Chapter  3,  while  Chapter  4  describes  the  CMOS  circuits  that  have 
been  implemented  in  PREMOS.  Finally  Chapter  5  presents  some  conclusions  and  comments. 


CHAPTER  2 


Circuit  Simulation  Techniques 


2.1  Introduction 

Conventional  circuit  simulators  have  proven  to  be  very  successful  at  providing  accurate 
current  and  voltage  waveforms  for  small  scale  integrated  circuits.  However,  in  LSI  and  VLSI 
design,  these  exact  waveforms  are  often  not  needed  as  much  as  just  the  timing  of  the 
transitions  from  one  logic  state  to  another.  This  led  to  the  development  of  logic  simulators[8] 
that  give  results  in  terms  of  discrete  levels.  These  logic  simulators  are  much  faster  than 
circuit  simulators,  but  the  loss  of  information  is  not  always  acceptable.  In  an  attempt  to 
bridge  this  gap  between  the  accuracy  of  circuit  simulators  and  the  speed  of  logic  simulators, 
timing  simulators  are  being  developed  with  the  goal  of  providing  waveforms  close  to  those  of 
circuit  simulators  at  speeds  approaching  those  of  logic  simulators. 

Most  timing  simulators  can  be  broken  into  two  categories:  those  based  on  conventional 
circuit  analysis  that  employ  various  techniques  to  increase  the  speed,  and  those  based  on  logic 
simulation  but  employ  various  techniques  to  increase  the  accuracy.  This  report  is  concerned 
with  techniques  more  closely  related  to  circuit  analysis  than  to  logic  simulation.  In  this 
respect,  some  of  the  existing  circuit  analysis  techniques  are  reviewed  in  this  chapter. 

2.2  Conventional  Clrtnlt  Apalrrli 

A  nonlinear  dynamic  circuit  may  in  general  be  characterized  by  the  equation, 

f(x,x,t)  -  0,  x(0)  -  x.  (2.1) 

where  x  represents  a  vector  of  voltages  and/or  currents  and  t  represents  time.  In  order  to 
obtain  a  numerical  solution  to  x,  Equation  (2.1)  is  discretized  at  each  time  point  tB  by  using 


an  integration  formula  such  as  the  backward  Euler,  trapezoidal,  or  one  of  Gear's  formulas. 
This  discretization  transforms  Equation  (2.1)  at  each  time  point  to  an  algebraic  equation  of 
the  form, 

g(xj  =  0  (2.2) 

Equation  (2.2)  is  then  usually  solved  by  a  modified  Newton’s  method  which  repetitively 
develops  and  solves  linear  equations  of  the  form, 

A  x  -  b  (2.3) 

where  A  is  a  matrix  and  b  is  a  vector.  At  every  iteration  A  and  b  must  be  constructed  by 
linearizing  the  nonlinear  equation  at  a  new  iteration  point  found  during  the  previous 
iteration.  This  process  is  repeated  until  the  sequence  of  x  converges  to  within  some  specified 
tolerance.  Once  the  solution  for  x.  at  a  this  point  is  found,  the  time  is  incremented  and  the 
process  is  started  again  until  the  final  time  is  reached.  The  basic  algorithm  can  be  given  as 

BEGIN 

BEGIN 

X  =  [Voltages,  Currents] 

TIME  =  Start  Time 
H  =  Initial  Time  step 
END  {initialization} 

TIME  -  TIME  +  H 
WHILE  (TIME  <  End  Time)  DO 
BEGIN 

Discretize  the  differential  operators  by 
using  an  integration  formula. 

REPEAT 
BEGIN  k  -  1 

Evaluate  linear  models  for  circuit  elements  at 
the  operating  points  and  form  the  circuit 
matrix  A  and  vector  b. 

Solve  linear  equations  AX  *  b. 

END 
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UNTIL  (convergence  achieved)  (dc  loop} 
IF  the  tool  truncation  error  (LTE)  is 
smaller  than  the  tolerance 
THEN 
BEGIN 

Compute  new  time  step  H 
TIME  «  TIME  +  H 
END 
ELSE 
BEGIN 

TIME  -  TIME-H 
Compute  revised  time  step  H 
TIME  *  TIME  +  H 
END 

END  {time  loop} 

END 


An  increase  in  speed  can  be  achieved  by  using  sparse  matrix  techniques.  However,  these 
techniques  are  not  cost  effective  for  VLSI  simulations;  as  a  result,  new  approaches  have 
recently  been  proposed.  The  basis  for  several  of  these  new  methods  will  be  described  in  the 
next  sections. 


2.3  Ltreo  Seal*  Circuit  Analysis 


A  number  of  techniques  based  on  relaxation  methods  have  been  proposed  for  solving 
large  systems  of  simultaneous  equations.  When  these  techniques  are  applied  to  circuits,  a 
priori  knowledge  of  some  of  the  properties  of  the  circuits  can  be  used  to  significantly 
decrease  the  amount  of  computation  required  while  at  the  same  time  maintaining  accuracy. 
However,  it  is  important  to  first  understand  the  basic  methods. 


2.4  Point  Ga«m»  Jacob!  Algorithm 


If  the  vector  x  in  Equation  (2.2)  satisfies  x  c  R"  and  xf  is  the  value  of  the  kth  component 
of  the  vector  x  at  time  t,,  the  value  of  x£*'  is  found  by  solving  the  scalar  equation, 

g(xf ,Xf ,  .  .  .  X* j,Xk,XkB+  . x“)  (2.4) 


The  complete  vector  x*" 1  is  found  by  incrementing  k  from  1  to  m. 


While  this  algorithm  is  fairly  easy  to  implement,  it  may  converge  to  the  solution  very 
slowly,  provided  that  it  converges  at  all.  A  natural  extension  of  this  algorithm  leads  to  the 


next  algorithm. 


2.5  Point  Gaim-Seldel  Algorithm 


In  this  algorithm,  information  from  the  present  iteration  is  used  as  well  as  information 
from  the  previous  iteration  which  usually  decreases  the  number  of  iterations  necessary  for 
convergence.  When  computing  component  xj*1,  the  equation  to  be  solved  is 


g^r1 


-n+  1 
x2  . 


.*£-1.  Xk,  x£+1, 


x£) 


(2.5) 


This  algorithm  can  result  in  a  considerable  increase  in  speed,  especially  when  the  circuit 
variables  are  evaluated  in  the  same  order  as  the  signal  flow  through  the  circuit. 


2.6  Block  Algorithms 

If  each  xk  represents  a  node  voltage,  then  from  the  network  point  of  view,  the  two 
methods  above  are  equivalent  to  decomposing  the  network  at  every  node.  Another 
possibility  is  to  decompose  the  circuit  into  subcircuits  composed  of  several  nodes.  When  the 
Gauss-Jacobi  or  Gauss-Seidel  methods  are  applied  in  conjunction  with  decomposition  into 
subcircuits,  Equations  (2.4)  and  (2.5)  can  still  be  used,  but  with  each  xk  representing  a  vector 
instead  of  a  scalar.  In  this  case  each  vector  xk  can  be  solved  using  the  conventional  circuit 
analysis  previously  described. 


2.7  Waveform  Belruatlon 


For  all  of  the  algorithms  described  so  far,  the  analysis  is  carried  out  at  each  time  point  for 
the  entire  circuit  before  proceeding  to  the  next  time  point.  However,  it  is  also  possible  to 
solve  for  the  waveform  of  each  subcircuit  over  the  entire  time  interval  before  proceeding  to 
the  next  subcircuit.  After  finding  the  waveforms  for  all  of  the  subcircuits,  the  process  can  be 
repeated  until  the  waveforms  converge  to  a  tolution[7]. 


CHAPTER  3 


Numerical  Properties 

3.1  Introduction 

The  techniques  of  the  previous  chapter  can  all  require  a  considerable  amount  of  time  to 
check  to  see  if  the  computed  solution  is  accurate  enough.  At  each  time  point  die  algorithm 
will  have  to  check  for  convergence  and  then  decide  whether  another  iteration  is  needed. 
Then  after  convergence  for  that  time  point  is  achieved,  the  local  truncation  error  (LTE)  must 
be  computed  so  that  a  decision  can  be  made  as  to  the  accuracy  of  the  point  in  question.  If 
the  LTE  is  too  large,  the  point  may  be  thrown  out  and  a  new,  smaller  step  must  be  taken 
before  repeating  the  process.  Even  if  die  point  is  accepted,  the  next  step  size  is  usually 
recomputed  before  proceeding.  The  end  result  is  that  a  lot  of  overhead  computation  not 
directly  related  to  the  desired  solution  b  performed. 

This  has  led  to  the  development  of  some  simulators  that  used  a  fixed  number  of  iterations 
at  each  time  point  as  well  as  a  fixed  step  size.  In  fact  some  of  these  simulators  such  as 
MOTTS,  use  only  a  single  iteration.  While  this  approach  can  obviously  save  a  lot  of 
computation  time  on  a  per  sweep  basis,  the  fact  that  MOTTS  only  takes  one  sweep  means  that 
the  relaxation  b  not  carried  to  convergence  and  classical  numerical  properties  such  as  stability 
and  accuracy  may  not  hold.  Therefore,  these  properties  must  be  evaluated  to  provide  a 
proper  basb  for  the  use  of  this  method. 

3.2  Stability 

In  general,  the  analysb  of  a  particular  method  b  studied  on  a  test  problem  simple  enough 
to  be  analyzed  theoretically,  yet  complex  enough  to  provide  information  about  a  wide  variety 
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of  applications.  The  most  common  test  problem  used  is  probably  the  single  time  invariant 
linear  differential  equation: 

i »  ax,  x(0)  -  x,  (3.1) 

where  "x"  and  "a"  are  scalars.  Using  a  single  equation  has  worked  well  when  an  integration 
method  like  the  backward  Euler,  trapezoidal,  or  one  of  Gear’s  formulas  has  been  carried  to 
convergence  at  each  step.  However  when  a  Gauss -Seidel  type  technique  is  used  and  the 
process  is  not  iterated  to  convergence  at  each  step,  the  values  of  the  variables  behave 
differently  according  to  the  order  in  which  they  are  processed.  Consequently,  the  test 
problem  must  be  generalized  to  become  a  system  of  equations, 

Cx-Gx,  x(0)«x.  (3.2) 

where  x  is  a  vector,  C  and  G  are  matrices,  and  C  is  invertible.  When  an  integration 
algorithm  is  applied  to  Equation  (3.2)  it  is  possible  to  express  the  value  of  x  at  tB.1 
recursively  in  the  form, 

x,*  i  m  M(h)  x.  (3.3) 

where  h  is  the  step  size  and  M(h)  is  known  as  the  companion  matrix.  In  terms  of  the  initial 
conditions,  Equation  (3.3)  becomes 

*..1  -  [M(h)ri  x,  (3.4) 

The  numerical  properties  of  one  step  integration  algorithms  are  then  defined  in  terms  of  the 
properties  of  the  companion  matrix. 

Definition  3.iri0l 

An  integration  algorithm  is  consistent  if  when  applied  to  Equation  (3.2),  its  companion 
matrix  can  be  expanded  in  a  power  series  as  a  function  of  the  step  size  b  as 


M(h)  *  I  +  hA  +  0(hJ) 


(3.5) 


where  A  equal  C_1G. 


Theorem  3.1 


For  a  given  step  size  h*.  the  sequence  of  vectors  {xj  in  Equation  (3.4)  is  bounded  if  and 
only  if  the  spectrum  (or  set  of  eigenvalues)  of  M(h*)  is  contained  in  the  unit  ball  B(0,1)  and 
no  multiple  zero  of  the  minimal  polynomial  has  modulus  equal  to  one[ll].  (The  requirement 
that  no  multiple  zero  of  the  minimal  polynomial  has  modulus  equal  to  one  is  the  same  as 
saying  that  all  the  eigenvalues  with  a  magnitude  of  one  must  be  distinct.) 

The  above  theorem  leads  to  the  following  definition  of  stability  for  an  integration 
algorithm: 

Definition  3.2 

An  integration  algorithm  is  zero  stable  if  and  only  if  there  exists  a  8  >  0  such  that  for  all 
h  in  the  interval  (0,  6),  the  spectrum  of  M(h)  is  contained  in  die  unit  ball  B(0,1),  and  no 
multiple  zero  of  the  minimal  polynomial  has  modulus  equal  to  one. 

Note  that  if  a  circuit  being  evaluated  is  stable,  then  the  solution  is  bounded,  and  die 
integration  algorithm  must  also  be  stable  if  an  accurate  solution  is  desired. 

Definition  3.3 

Let  x(t)  be  the  exact  solution  of  the  test  problem.  An  integration  algorithm  is  convergent 
if  the  sequence  of  the  computed  solution  converges  uniformly  to  x(t)  as  the  step  size  tends  to 


Theorem  3.2 


If  u  integration  algorithm  is  consistent  and  zero  stable,  then  It  is  convergent. 

The  proof  of  Theorem  3.2  can  be  found  in  numerical  books  such  as  Reference  [12]. 

It  has  been  proven  in  Reference  [10]  that  the  Gauss-Seidel  method  is  convergent  if  there 
are  no  floating  capacitors  (i.e.,  capacitors  connected  between  two  nonground  nodes)  present 
in  the  circuit.  However,  it  has  also  been  proven  in  Reference  [10]  that  the  Gauss-Seidel 
method  is  not  consistent,  and  therefore  may  not  be  convergent,  whenever  floating  capacitors 
are  present.  This  led  Wei  to  introduce  a  modified  Gauss-Seidel  method  which  is  convergent 
even  when  floating  capacitors  are  present  in  the  circuit.  However,  from  the  definition, 
convergence  is  dependent  upon  the  step  size  tending  to  zero.  In  practice  it  is  desirable  to 
make”  the  step  size  as  large  as  possible  for  the  desired  degree  of  accuracy.  Therefore,  the 
range  of  h  for  Much  a  method  is  stable  is  important,  which  is  a  different  issue  than 
convergence,  and  thus  has  to  be  studied  separately.  Another  issue  of  importance  is  that  die 
presence  of  complex  conjugates  in  the  spectrum  of  M(h)  may  indicate  an  oscillatory 
component  in  the  computed  solution  that  may  not  be  present  in  the  exact  solution.  These 
issues  will  be  examined  now  by  using  a  simple  test  circuit  shown  in  Figure  3.1  to  generate  a 
system  of  equations  and  the  corresponding  companion  matrices. 

3.3  Th«  Companion  Matrices 

Applying  nodal  analysis  to  the  circuit  of  Figure  3.1,  we  obtain  the  matrix  equation: 


«l+«l  -«3 


]hl  +  *,-!>]  hi ,  o 


III 


Ci+hgu  -Cj+hgj,  M  _  c*  -c,  V! 
-ej+hg,,  c,+hg„]  [vjJb  [-C,  Cj]  [v,]^ 


0 


(3.19c) 


At  this  point  the  derivation  for  the  companion  matrices  of  a  full  matrix  solution,  the 
standard  Gauss-Seidel  and  the  predictor  Gauss-Seidel  methods  differ.  For  a  full  matrix 
solution  this  becomes 


where 


ci+hgn  -Cj+hgu]"1  [c,  -Cjl  fv, 

-Cj+hgji  Cj+hgjjJ  [-Cj  CjJ  [v,Jt  i 


.  J_  fkn  U  k 

Ak  kn  kji  vj  ,_j 


ku  »  c»  (cj+hgM)  -  Cj(c,-hgx,) 
ku  *  -c,(c,+hg„)  +  c,(cj-hgu) 
kji  -  ^(Cj-bgn)  -  c,(cj+hgn) 
kja  -  -c,(c,-hgjX)  +  ^(cj+hgjj) 

Ak  -  (Cj+hgjjJfCj+hgjj)  -  (-Cj+hgJC-Cj+hg,,) 


(3.20a) 

(3.20b) 

(3.21) 

(3.22) 

(3.23) 

(3.24) 

(3.25) 


3.3.  J  Standard  Gauss-Seidel 

When  applying  the  standard  Gauss-Seidel  technique,  Equation  (3.19)  becomes 


ci+bgu  0  1  [Vll 

«  »  • 

Cl  -c,  V, 

[0  -Cj+hgu 

Vj 

“Cj+hgjj  Cj+hg2jJ  [v,J  “ 

-C>  Cj  V* 

© 

o 

VJ 

(3.26a) 
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Solving  for  VM  and  V2^, 


1  ®lt  Bu 

A.  I®ji  ®jj 


*J.-t 


(3.27) 


where 


«n  -  Ci(Ci+«*g2J) 

(3.28) 

®u  “  -hgufe+hg^) 

(3.29) 

®ai  “  ci(cs*"kg2l)— c^Cj+hg^) 

(3.30) 

-  -Cj(Cj-hg21)  +  c^Cj+hgJ 

(3.31) 

A.  -  “Cj(cj-hg2l)  +  Cj(Cl+hgu) 

(3.32) 

The  spectrum  of  M^jCh)  can  then  be  found  as  the  roots  of  the  equation, 

“  (Hi  +  ®mVA.)X  +  (HamjiVA’)  -  0 

3.3.2  Predictor  Gauss-Seidel 


(3-33) 


In  Reference  [10]  Wei  introduced  a  modified  Gauss-Seidel  method  whidt  used  a  forward 
predictor  for  unsolved  node  voltages.  The  voltages  were  predicted  according  to  the  formula, 


(  V*-1  —  V,_l  1 

»  vr »  +  h.  [  *  h— ‘  j  (3.34) 

For  a  more  general  formula  which  generates  a  family  of  equations  and  includes  the 
original  predictor  method  as  a  special  case,  the  following  predictor  equation  is  proposed 


(3.34) 


V»  m  V*~l  4.  — 

)  Vf*  +  l+7h. 


(3.35) 
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This  family  of  equations  obviously  reduces  to  Equation  (3.34)  when  gamma  equals  zero, 
but  approaches  the  standard  Gauss-Seidel  method  as  gamma  goes  to  infinity.  Letting 


9  -  '3'36> 
applying  Equation  (3.35)  to  Equation  (3.19),  and  assuming  that  h,  =  h,_j  *=  h,  we  obtain 

Lvfr  L.J  fcl  =  fC‘  'h8“  +  fcl  (3.37) 


Ci+hgu  0  v 

"Cj+hgjj  Cj+hgjj  v. 


i  _  k  “bgu  +  etcj-hgjl  k 

’J.  L_c3  *  M.-i 


+  0-e(Cj-hgu)  k 


0  V. 


2J.-J 


‘n+hgn  0  0  Cj  -hgu+e(Cj- 

-c,+hgjl  Cj+hgjj  0  -  -c,  Cj 

0  0  1  K-t]  0  1 

.  J 

The  predictor  companion  matrix,  MPOS(h)  is  found  as 


u  1  k  -bgu+e^-bgu)  “0(c3~hgu)  r 


0  v. 


(3.38) 


Cl+fagu 

0 

0 

-i 

-bgu+«(cj-bgu)  -e(cj-hgu) 

-c,+hg21 

Cj+hg„  0 

-c,  C,  0 

0 

0 

ij 

0  1  0 

(3.39) 


P  Q  R 
S  T  U 
.0  1  0 


where 
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P  -  Cl/px  (3.40) 

Q  »  (-hgu  +  «(c,-hgu)y0i  (3-41) 

R  -  (-  0(cJ-hgll)V3l  (3.42) 

S  -  ^(Cj-hgjjVO^,)  -  Cj/p,  (3.43) 

T  -  (c,~hgai)(-hgia  +  efCj-hguJVO^j)  +  Cj/pj  (3.44) 

U  =  (-e(ca-hgai)(ca-hgia))/(310a)  (3.45) 

3i  «  (Ct+hgu)  (3.46) 

3,  -  (Ci+fcg21)  (3.47) 

The  spectrum  of  MPCS(h)  is  then  found  as  the  roots  to  the  cubic  equation, 

X1  -  (P+T)XJ  +  (PT-SQ-U)X  +  (PU-SR)  *  0  (3.48) 

Given  a  cubic  equation,  an  explicit  solution  in  terms  of  the  coefficients  is  found  as 

follows[13].  A  cubic  equation  of  the  form 

y3  +  py*  +  qy  +  r  -  0 


may  be  reduced  to  the  form 

x3  +  ax  +  b  =  0 

by  substituting  (x  •  p/3)  for  y.  When  this  is  done,  then 

a  «  (3q  -  p*)/3 
and 

b  =  (2p3  -  9pq  +  27r)/27 


For  solution  let 


A  -  ^ (-b/2)  +  V(b*/4  +  aV27) 


and 


B  =  ^  (— h/2)  -  V(bJ/4  +  aJ/37) 

Finally,  the  values  of  x  will  be  given  by 

x  «  A+B,  -(A+BV2  +  ((A-BV2)  V^3,  -  (A+B)/2  -  ((a-B)/2)  VC3 

When  the  above  formulas  are  applied  to  Equation  (3.48)  the  expressions  obtained  are 
complicated  enough  that  the  various  numerical  properties  of  the  method  are  probably 
masked.  Instead,  the  numerical  properties  of  the  methods  when  applied  to  the  test  circuit  of 
Figure  3.1  with  different  parameter  values  are  investigated. 

3.4  Stability  and  Acenracr  Stndtes 

In  this  section  several  studies  of  the  circuit  given  in  Figure  3.1  will  be  presented.  The 
component  values  in  each  case  are  selected  so  that  the  circuit  is  stable  with  strictly  real 
eigenvalues  and  then  the  spectrum  of  each  companion  matrix  as  a  function  of  the  step  size  is 
plotted.  Each  plot  will  have  a  solid  ellipse  representing  the  upper  half  of  the  unit  circle  and  a 
series  of  discrete  points  that  give  the  spectrum  at  specific  values  of  h.  The  values  of  the 
components  will  not  be  realistic  for  practical  circuits,  but  by  proper  scaling  these  results 
should  be  applicable  to  a  wide  variety  of  cases.  It  can  be  seen  from  Equation  (3.19)  that  if 
each  capacitance  and  the  step  size  h  are  multiplied  by  the  same  factor  o1(  then  the  results 
should  remain  unchanged.  Similarly,  if  each  conductance  is  multiplied  by  a  factor  a2,  and  the 


step  size  is  divided  by  a2  men  once  again  the  results  should  be  the  same.  In  summary,  if  the 
system 


Ci  +  Gx-0 


(3.49) 


results  in  the  companion  matrix  M(h),  then  the  system 

a,Ci+BjGx»0 


(3.50) 


a. 

will  give  the  companion  matrix  M( —  h). 

The  accuracy  studies  will  be  conducted  by  exciting  node  1  of  the  circuit  with  a  current 
source  of  1.0  amp  for  all  time  t  >  0.0  (with  0.0  amps  as  the  input  prior  to  t  =  0.0).  This 
step  input  will  allow  the  response  of  the  methods  to  a  rapidly  changing  input  to  be  compared 
by  observing  the  voltage  computed  for  node  2. 

3.4.1  Study  A 

By  letting  dj  *  dj  *  dj  *  1  (farad)  and  gi=gi=gi:=gm3xgrzxl  (mho)  this  study  provides  a 
reference  to  which  other  studies  can  easily  be  compared.  For  this  circuit  with  eigenvalues 
•2/3  and  -2,  Figure  3.2  shows  that  both  the  standard  and  predictor  Gauss-Seidel  methods  are 
stable  for  0  <  h  <  ®.  The  standard  method,  shown  in  Figure  3.2a,  is  strictly  real  for  all  h. 
The  spectrum  of  the  pure  predictor  method,  (i.e.,  gamma  *  0.0  in  Equation  3.36)  shown  in 
Figure  3.2b  has  complex  components  for  h  greater  than  one*third  and  reaching  a  maximum  at 
about  h  =*  1.  Finally,  the  modified  predictor  spectrum  is  shown  in  Figure  3.2c  with 
gamma  *  1.0.  As  expected,  this  spectrum  appears  to  be  a  weighted  average  of  the  previous 


Figures  3.3  and  3.4  are  plots  of  the  voltage  at  node  2  for  h  **  1.0  and  h  =  0.1 
respectively.  Comparing  the  standard  and  predictor  Gauss-Seidel  (gamma  =  0.0)  methods 
with  the  solution  generated  by  a  full  matrix  approach,  we  see  in  this  case  that  all  three 
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Standard  Gauss-Seidel  Spectrum 
for  Study  A. 


Figure  3.2b  Predictor  Gauss-Seidel  Spectrum 
with  gamma  =  0.0  for  Study  A. 


at  atari*  9.00000  lambda  1  -  -0.60607 

at  atop*  13.13200  lambda  2  -  -2.00000 


Seidel  Spectrum 
1.0  for  Study  A. 
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methods  have  comparable  accuracy  for  a  given  step  size.  Comparing  the  solutions  generated 
by  the  two  different  step  sizes,  die  larger  step  size  does  not  allow  the  voltage  to  change  as 
quickly  as  the  smaller  step  size.  Sharp  corners  and  the  lagging  response  in  Figure  3.3 
indicate  that  a  step  size  (h  **  1.0)  of  the  same  order  of  magnitude  as  the  eigenvalues  (-2/3 
and  *2)  may  result  in  an  inaccurate  solution  when  there  is  a  rapidly  changing  input. 

3.4.2  Study  B 

In  this  example,  6,  is  reduced  by  a  factor  of  ten.  This  has  die  effect  of  decreasing  the 
stiffness  of  the  circuit  (i.e.,  ratio  of  the  eigenvalues)  and  allows  the  spectrum  of  the  standard 
method,  Figure  3.5a  and  that  of  the  predictor  method  (gamma  *  0.0),  Figure  3.5b,  to  be 
strictly  real  and  within  the  unit  circle.  This  indicates  neither  method  should  have  any 
problems  with  instability  or  unwanted  oscillations  for  any  value  of  h. 

3.4.3  Study  C 

By  increasing  the  feedback  capacitance  fi,  to  100,  a  very  stiff  system  results.  Four 
different  spectra  are  plotted  in  Figure  3.6.  While  both  methods  (Figures  3.6a  and  3.6b)  still 
remain  stable  for  all  h,  the  pure  predictor  method  (gamma  *  0.0)  has  a  complex  spectrum 
for  h  >  2.8.  Increasing  gamma  to  1.5  completely  removes  the  complex  components  of  the 
predictor  method  (Figure  3.6c).  The  magnitude  of  the  complex  component  reaches  a 
maximum  at  h  *  25.0  and  in  this  case  it  is  seen  in  Figure  3.7  that  the  pure  predictor  method 
(i.e.,  gamma  **  0.0)  has  an  oscillatory  component  in  the  solution  that  is  not  present  using  the 
other  methods.  As  expected,  increasing  gamma  to  1.5  will  remove  these  oscillations. 
However,  it  was  found  experimentally  that  a  gamma  as  small  as  0.05  was  enough  to  remove 
the  oscillations  and  allowed  a  more  accurate  solution  (Figure  3.8).  The  root  locus  plot  for 
gamma  **  0.05  (Figure  3.6d)  shows  that  in  this  case  the  maximum  magnitude  of  the  complex 
part  of  the  roots  (now  shifted  to  h  *  12.5)  was  reduced  by  approximately  40  percent,  from 
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Figure  3.6a  Standard  GauM-Seidel  Spectrum 
for  Study  C. 


Predictor 


Predictor  Gauss-Seidel  Spectrui 
with  gamma  =1.5  for  Study  C 


Figure  3.6d  Predictor  Gau$s*Seidcl  Spectrum 
with  gamma  =  0.0S  for  Study  C. 


Figure  3.7  Node  2  output  college  with  >tep  size 

h  “  23.0  end  gunmt  =  0.0  for  Study  C. 


with  the  solution  generated  by  a  full  matrix  approach,  we  see  in  this  case  that  all  three 


Figure  3.8  Node  2  output  voltage  with  step  size 


Since  the  eigenvalues  of  the  test  circuit  are  strictly  real  (-0.1  and  -2.0),  the  exact  solution 
will  not  have  any  oscillatory  components.  Therefore,  any  oscillations  in  a  computed  solution 
are  due  to  the  numerical  technique  and  not  the  physical  circuit.  By  carefully  choosing  the 
step  size,  unwanted  numerically  generated  oscillations  can  be  observed.  However,  it  should 
be  noted  that  the  step  size  necessary  was  much  larger  than  either  of  the  eigenvalues,  and 
study  A  has  already  demonstrated  that  a  step  size  smaller  than  the  eigenvalues  can  be 
desirable  for  accuracy  when  an  input  to  the  circuit  changes  rapidly.  This  study  also  clearly 
shows  that  the  existence  of  complex  roots  in  the  spectrum  of  the  companion  matrix  is  not 
enough  to  indicate  oscillations  are  present  in  the  computed  solution.  Since,  in  general,  a  step 
size  larger  than  any  of  the  eigenvalues  may  not  generate  an  accurate  solution  in  response  to  a 
rapidly  changing  input,  this  indicates  that  these  unwanted  oscillations  are  probably  not  a 
major  factor  in  generating  accurate  solutions. 

3.4.4  Study  D 

In  this  case,  with  a  very  large  feedback  term  represented  by  g,,  *=  100,  we  find  that  both 
the  standard  Gauss-Seidel  (Figure  3.9a)  and  die  predictor  Gauss-Seidel  (gamma  =  0.0) 
(Figure  3.9b)  methods  become  unstable  for  h  within  a  finite  interval.  For  this  very  stiff 
system  with  eigenvalues  of  -0.037  and  -35.629,  the  solution  would  normally  be  obtained  by 
starting  with  a  very  small  step  size  to  catch  initial  transients  and  then  increasing  the  step  size 
so  that  the  rest  of  the  interval  would  not  require  too  much  computation  time.  However,  the 
standard  method  is  unstable  for  1  <  h  <  16  while  the  pure  predictor  method  is  unstable  for 
1  <  h  <  81.  If  an  automatic  time  step  control  scheme  increased  the  step  size  too  much  (i.e., 
h  >  1.0),  then  neither  method  would  be  satisfactory. 

For  completeness,  the  spectrum  of  the  predictor  method  with  gamma  =  2.0  is  included  as 
Figure  3.9c  to  demonstrate  that  the  modified  predictor  method  has  properties  in  between 
those  of  the  standard  and  the  pure  predictor  methods. 


Figure  3.9c  Predictor  GausS'Seide)  Spectrum 
with  gamma  =  2.0  for  Study  D. 
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3.4.5  Study  E 

This  case  demonstrates  properties  of  the  methods  when  a  very  large  transconductance  is 
present.  A  simple  exchange  of  the  values  for  gp  and  g.  yields  a  circuit  with  exactly  the  same 
eigenvalues  as  in  study  D.  While  the  predictor  method  with  gamma  =  0.0  (Figure  3.10a) 
still  has  a  finite  interval  where  it  is  unstable,  die  standard  Gauss-Seidel  method 
(Figure  3.10b)  is  stable  for  all  positive  h.  Comparing  the  C  and  G  matrices  for  this  test  and 
for  the  previous  test,  we  find  that  the  only  real  difference  is  that  the  one  element  has  moved 
to  form  a  lower  triangular  instead  of  an  upper  triangular  G  matrix.  To  make  the  modified 
predictor  method  stable  for  all  h,  gamma  is  set  to  5.0,  as  in  Figure  3.10s. 

An  accuracy  test  with  h  **  81  (Figure  3.11)  shows  that  die  pure  predictor  method  can 
oscillate  around  the  full  matrix  solution  when  the  step  size  is  much  larger  than  any  of  the 
eigenvalues.  In  order  to  use  the  predictor  method  with  step  size  h  *  1.0,  gamma  must  be 
nonzero.  If  gamma  =  5.0  as  in  the  spectrum  plot  of  Figure  3.10c,  the  accuracy  plot  of 
Figure  3.12  results.  In  this  case  (h  =  1.0),  neither  the  standard  or  the  predictor  Gauss-Seidel 
(gamma  =  5.0)  methods  are  very  close  to  the  full  matrix  approach.  If  the  step  size  is 
reduced  to  0.1,  as  in  Figure  3.13,  the  predictor  method  comes  much  closer  to  the  full  matrix 
solution,  while  the  standard  Gauss-Seidel  method  continues  to  differ  greatly.  If,  in  addition 
to  reducing  die  step  size  to  0.1,  gamma  is  reduced  from  5.0  to  0.0,  then  the  pure  predictor 
method  is  found  to  follow  the  full  matrix  solution  almost  exactly  (Figure  3.14). 

3.4.6  Studies  F  and  G 

Both  of  these  cases  lack  the  feedback  capacitor,  and  have  die  same  conductance 
matrix,  G.  The  only  difference  is  an  interchange  of  the  input  and  output  capacitances, 
fij  and  £j.  This  results  in  both  circuits  having  identical  eigenvalues  and  the  major  difference 
being  a  scale  factor  for  each  row  of  the  A  matrix.  A  comparison  of  the  spectrum  plots  for 
study  F  (Figure  3.15)  and  for  study  G  (Figure  3.16)  shows  that  the  results  are  identical  when 


Figure  3.10b  Predictor  Giuss-Seidel  Spectrum 
with  gamma  =  0.0  for  Study  E. 


Predictor  Gauss -Seidel  Spectrum 
with  gamma  =  5.0  for  Study  It. 


Figure  3.11  Node  2  output  voltage  with  step  lize 

h  =  81.0  and  gamma  =  0.0  for  Study  E. 
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Figure  3.13  Node  2  output  voltage  with  step  size 

h  =  0.1  and  gamma  =  5.0  for  Study  li. 


Figure  3.14  Node  2  output  voltage  with  Hep  size 

h  =  0.1  and  gamma  =  0.0  for  Study  E. 


Figure  3.15«  Standard  Gauss -Seidel  Spectrum 
for  Study  F. 


Figure  3.15b  Predictor  Gouss-Seidel  Spectrum 
with  gamma  =  0.0  for  Study  F. 


Figure  3.15c  Predictor  Gnuss-Seidel  Spectrum 
with  gamma  =  0.25  for  Study  F. 
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Figure  3.16a  Standard  Gouss-Seidel  Spectrum 
(or  Study  G. 


Figure  3.16b  Predictor  Gauss-Seidel  Spectrum 
with  gnmrui  =  0.0  for  Study  G. 
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gure  3.16c  Predictor  Gauss-Seidel  Spectra 
with  gamma  =  0.9  for  Study  G 


a  specific  method  is  compared  to  the  same  method.  One  interesting  result  of  these  studies  is 
that  even  though  both  the  capacitance  and  conductance  matrices  are  diagonally  dominant 
(i.e.,  for  each  row,  the  sum  of  the  off-diagonal  elements  is  less  than  the  absolute  value  of  the 
diagonal  element  [14]),  the  predictor  Gauss-Seidel  method  is  unstable  for  all  h  >  64.0,  while 
the  standard  Gauss-Seidel  method  is  stable  for  all  h.  An  accuracy  plot  (Figure  3.17)  shows 
that  for  h  comparable  to  the  eigenvalues,  both  the  standard  and  predictor  Gauss-Seidel 
methods  have  accuracy  very  close  to  that  of  a  full  matrix  solution. 

The  studies  conducted  so  far  have  been  chosen  to  illustrate  behavior  with  respect  to 
specific  component  values  in  the  test  circuit.  The  next  three  studies  are  included  to  allow 
further  comparison  based  upon  the  structure  of  the  capacitance  and  conductance  matrices. 

3.4.7  Studies  H.  J.  and  K 

Study  H  demonstrates  behavior  when  all  the  elements  in  both  the  capacitance  and 
conductance  matrices  are  nonzero.  As  in  study  F,  both  the  C  and  G  matrices  are  diagonally 
dominant.  A  wider  range  of  h  for  which  the  spectra  are  complex  is  observed  in 
Figures  3.18a  and  3.18b.  The  predictor  method  still  is  unstable  for  large  enough  h,  while  the 
standard  method  is  stable  for  all  h. 

Study  J  has  the  same  C  and  G  matrices  as  study  H,  except  that  the  upper  hand  element  of 
the  conductance  matrix  has  been  set  to  zero.  This  creates  a  lower  triangular,  diagonally 
dominant  G  matrix.  In  this  example,  Figures  3.19a  and  3.19b  show  that  both  methods  are 
stable  for  all  h. 

Study  K  again  has  the  same  C  and  G  matrices  as  study  H,  except  that  this  time  the  lower 
lefthand  element  has  been  set  to  zero.  This  yields  an  upper  triangular  conductance  matrix. 
As  in  study  J,  both  the  standard  and  predictor  Gauss-Seidel  (gamma  ■  0.0)  methods  are 
stable  for  all  h.  (Figure  3.20.) 


Time  (second*) 

Figure  3  17  Node  2  output  voltage  with  step  size 

h  3  1.0  and  gamma  -  0.0  for  Study  F. 
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Figure  3.18a  Stiinilnrd  Gnuss-Scidcl  Spectrum 


Figure  3.19a  Standard  Gnuss-Scidcl  Spectrum 


iricl  Spectrum 
for  Study  J. 


Figure  3.20a  Standard  Gnuss-Scidcl  Spectrum 
for  Study  K. 
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The  results  of  each  of  these  tea  studies  are  summarized  in  Figure  3.21.  Included  for  each 
study  are  die  capacitance  (C),  conductance  (G),  and  A  =  (-C_1G)  matrices,  along  with  the 
eigenvalues  of  the  test  circuit.  In  the  following  discussion,  the  term  "standard  method" 
means  "standard  Gauss-Seidel  method"  and  the  term  "predictor  method"  meant  "predictor 
Gauss-Seidel  method.” 

A  comparison  of  studies  A,  B,  and  C  shows  that  the  differences  result  from  the  difference 
in  the  capacitance  matrix.  By  referring  to  Equation  (3.6),  it  can  be  seen  that  the  C  matrix 
will  always  be  symmetric  (i.e.,  C  »  C1)^]  and  will  be  diagonally  dominant  as  long  as 
and  fij  are  both  nonzero.  This  indicates  that  if  the  G  matrix  is  strictly  diagonal,  both  the 
standard  and  predictor  methods  will  be  stable  for  all  h.  If  the  feedback  capacitance,  e,,  is 
very  large  in  comparison  to  the  other  two  capacitance  then  some  undesirable  oscillation  can 
occur,  but  stability  is  maintained. 

The  next  two  studies,  D  and  E,  are  the  only  two  studies  with  conductance  matrices  that 
are  not  diagonally  dominant.  They  are  also  the  only  two  studies  with  a  finite  range  of  h 
where  the  predictor  method  is  unstable.  In  every  other  study,  if  the  predictor  method  is 
stable  for  some  ha,  then  the  method  is  stable  for  all  h  <  ha.  This  stability  property  is  also 
true  for  the  standard  method  in  study  D.  (The  standard  method  is  stable  for  all  h  in  all 
studies  except  D.)  Since  the  interval  of  h  for  which  the  methods  are  unstable  includes  a 
region  between  the  value  of  the  eigenvalues,  it  is  likely  that  a  fast  and  accurate  solution 
would  be  difficult  to  achieve  for  study  D  with  either  method.  (This  is  because  a  step  size  too 
large  would  either  be  in  the  unstable  region  or  could  cause  large  inaccuracies.  But  a  small 
step  size  slows  the  speed  of  analysis.) 
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Figure  3.21  Comparison  of  capacitance,  conductance 
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Study  E,  where  the  G  matrix  is  lower  diagonal,  results  in  the  standard  method  being 
stable  for  all  h,  while  the  predictor  method  is  still  unstable  for  a  finite  range  of  h  that  lies 
between  the  value  of  the  eigenvalues.  This  is  an  example  where  the  standard  method  seems 
more  desirable  because  of  the  stability  properties.  However,  the  accuracy  plots  show  that  for 
h  where  the  predictor  method  is  stable,  the  predictor  method  can  also  be  much  more 
accurate.  When  possible,  Figure  3.14  indicates  that  the  most  desirable  technique  would  be  to 
use  the  predictor  method  where  it  is  stable  with  gamma  equal  to  zero.  If  this  step  size  is  too 
«n»li  for  the  desired  speed  of  analysis,  then  the  smallest  gamma  possible  that  insures  stability 
would  be  needed  for  greatest  accuracy. 

The  last  five  studies  all  have  a  similar  diagonally  dominant  conductance  matrix.  The  first 
two  of  these  five,  studies  F  and  G,  have  a  strictly  diagonal  capacitance  matrix  which  result  in 
each  row  of  the  A  matrix  being  scaled,  with  no  effect  on  die  stability.  The  last  two,  studies  I 
and  K,  have  one  element  in  the  conductance  matrix  zeroed  so  that  the  conductance  matrix 
becomes  triangular.  This  seems  to  increase  die  range  of  h  for  which  the  methods  are  stable. 

However,  making  an  off-diagonal  element  zero  should  allow  the  matrices  to  behave  more 
like  diagonal  matrices,  and  the  first  three  studies,  A,  B,  and  C,  have  already  demonstrated 
the  behavior  of  a  strictly  diagonally  G  matrix. 

In  summary,  both  methods  had  either  stability  or  accuracy  problems  if  the  conductance 
it  strix  was  not  diagonally  dominant.  In  these  studies,  if  the  conductance  matrix  is  diagonally 
dominant,  then  the  standard  Gauss-Seidel  method  is  stable  for  all  h,  but  a  dominant 
conductance  matrix  is  not  sufficient  to  ensure  that  the  predictor  Gauss-Seidel  method  is  stable 
for  all  h.  In  each  case,  however,  if  the  method  is  stable  for  some  h,,  then  it  is  also  stable  for 


all  h  <  h,.  Therefore,  •  proper  choice  of  gamma  should  be  possible  to  make  the  predictor 
method  as  stable  as  the  standard  method  but  with  increased  accuracy  for  the  same  step  size. 
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CHAPTER  4 
PREMOS 


P  REM  OS  (PREdiction  Based  simulator  for  MOS  circuits)  is  an  experimental  circuit 
analysis  program  that  is  an  attempt  to  bridge  the  gap  between  conventional  circuit  simulation 
and  conventional  logic  simulation.  PREMOS  was  developed  by  Wei  at  die  University  of 
Illinois  and  it  evolved  directly  from  MOTIS-C;  but  a  number  of  basic  changes  were 
implemented  in  an  attempt  to  increase  the  speed  and  accuracy.  In  addition  to  the  predictor 
method  described  in  the  previous  chapter,  other  changes  include  new  data  structures  so  that 
new  analysis  algorithms,  such  as  analysis  sequencing,  could  be  implemented. 


The  subcircuit  models  built  into  the  program  are  a  major  factor  why  PREMOS  is  faster 
than  conventional  circuit  simulators  such  as  SFICE2.  However,  if  a  circuit  cannot  be 
constructed  from  the  built-in  models,  then  PREMOS  cannot  be  used  for  simulations.  Thus,  a 
user-specified  subcircuit  description,  as  well  as  an  automatic  partitioning  algorithm,  are 
needed  to  make  the  program  more  useful.  Originally  PREMOS  was  restricted  to  NMOS 
circuit  simulations,  but  as  part  of  the  work  reported  in  this  diesis,  several  new  subcircuits 
have  been  added  to  allow  a  limited  CMOS  circuit  capability.  These  new  subcircuits  as  well  as 
how  to  use  PREMOS  are  described  in  the  following  sections. 


4.2  Iaoat  Circuit  Dgggtt 


The  program  PREMOS  supplies  the  user  with  a  variety  of  subcircuit  models,  henceforth 
referred  to  as  primitives,  that  consist  of  a  specific  interconnection  of  transistors.  The 
parameters  (such  as  width  to  length  ratio  of  the  transistors  and  node  capacitances)  for  each 
type  of  subcircuit  are  specified  using  the  "model"  card.  The  model  card  is  also  used  to  give  a 


name  to  each  type  of  subcircuit,  to  that  each  primitive  can  be  used  repeatedly  with  a  different 
set  of  parameters  each  time.  The  general  format  of  the  model  card  is 

MODEL  (modname)  (primitive)  (parameters) 

After  each  necessary  type  of  subcircuit  has  been  described  and  named,  the  interconnection 
of  the  subcircuits  is  specified.  In  this  section  of  the  input  the  user  gives  each  subcircuit  a 
distinct  name,  specifies  to  which  nodes  the  subcircuit  is  connected,  and  gives  the  model  name 
specifying  the  proper  type  of  subcircuit.  The  general  format  of  the  interconnect  card  is 

(name)  (nodes)  (modname) 

with  one  exception.  This  exception  is  when  a  subcircuit  uses  the  SOURC  primitive. 

4.2.1  The  SOURC  Primitive 

The  SOURC  primitive  is  unique  because  it  is  used  to  describe  voltage  sources  that  switch 
between  two  levels,  such  as  clock  pulses,  rather  than  an  interconnection  of  transistors.  The 
pulses  are  specified  by  a  series  of  Ts"  and  "O’s”  following  the  model  name  on  the 
interconnect  card  to  indicate  when  the  source  is  at  the  high  or  low  voltage  level,  respectively. 
Linear  interpolation  is  used  to  determine  the  voltage  level  during  the  rise  and  fall  times  when 
switching  between  levels.  An  example  of  this  switching  behavior  is  given  in  Figure  4.1.  As 
long  as  t^  +  t^  is  the  same  as  tM(k  +  t^,  (specified  with  the  MODEL  card),  then  the  user 
should  have  no  trouble  as  two  consecutive  digits  will  always  have  the  same  total  duration. 
However,  if  the  above  condition  does  not  hold,  then  care  should  be  taken. 

4.2.2  The  CMOS  Cells 

Four  new  CMOS  primitives  have  been  added  to  PREMOS.  These  are  an  inverter 


(Figure  4.2),  a  2-input  NAND  (Figure  4.3),  a  2-input  NOR  (Figure  4.4),  and  a  variable 
number  of  transfer  gates  in  series  (Figure  4.5).  The  notation  that  follows  is 
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win  »  width-to-length  ratio  of  the  a -channel  trassiston 
wlp  -  width-to-length  ratio  of  the  p-chaasel  transistors 
eg  ■  gate  (input)  node  capacitance 
d  *  internal  node  capacitance 
d  *  load  (output)  node  capatitance 

and,  additionally,  for  the  transfer  gates 
cs  *  source  (input)  node  capacitance 
cgn  *  gate  of  n-channel  transistor  capacitance 
cgp  a  gate  of  p-channel  transistor  capacitance 
ct  »  intertransfer  gate  transistor  capacitance 
nt  a  number  of  transfer  gates 

The  parameters  for  the  primitives  are  then  specified  in  the  following  order: 

CMINV  win  wlp  eg  cl 
CMNOR  win  wlp  eg  ci  d 
CMNAN  win  wlp  eg  a  d 
CMTFR  win  wlp  a  cgn  cgp  ct  d  nt 

To  specify  circuit  connections,  the  user  must  assign  node  numbers  for  each  of  these  cells 
in  the  order: 

CMINV  ngnl 

CMNOR  nl  n2  ni  nl 

CMNAN  nl  n2  ni  nl 

CMTFR  ns  nnl  npl  nil  nn2  np2  .  .  .  al 

The  first-time  user  should  note  that  even  internal  node  numbers  for  each  cell  must  be 
specified  even  though  internal  nodes  should  never  be  connected  to  another  cell. 


■> 
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To  allow  greater  flexibility,  the  CMOS  transfer  gates  have  not  been  specifically  linked  to 
other  type  of  logic  as  was  done  by  Wei  for  NMOS  transfer  gates.  However,  large 
inaccuracies  may  result  if  the  signal  flow  within  the  circuit  is  from  the  output  node  to  the 
source  node  of  a  series  of  transfer  gates  instead  of  from  the  source  to  the  output.  Therefore, 
the  user  is  encouraged  to  connect  the  output  node  of  transfer  gates  only  to  the  input  node  of 
other  cells.  It  should  also  be  noted  that  proper  operation  of  CMOS  transfer  gates  requires 
complementary  control  signals  to  the  n-channel  and  p-channel  transistors.  Therefore,  the  use 
may  need  to  add  an  inverter  so  that  both  polarities  of  the  signal  are  available. 

4.3  Control  Commindi 

The  control  commands  are  input  into  P  REM  OS  in  the  same  file  as  die  input  data 
description.  The  nine  basic  commands  are  described  in  Appendix  3  of  Reference  [10].  The 
only  changes  have  to  do  with  the  additional  CMOS  subdrcuits  and  the  modification  to  the 
predictor  Gauss-Seidel  method. 

The  OPT  card  is  used  to  specify  the  number  iterations  allowed  for  each  primitive  for  any 
single  point  in  time.  The  number  of  dc  iterations  fox  the  CMOS  primitives  can  either  be 
concatenated  to  the  string  given  in  Reference  [10],  or  if  a  zero  is  input  as  the  first  number, 
then  the  CMOS  dc  iterations  can  be  inputted  directly.  That  is, 

OPT  itnan  itnor  ittrs  itpul  hlch  itao  rtoa  itmos  itcin  itcor  itcan  item 
or, 

OPT  0  itcin  itcor  itcan  item 

where  itcin,  itcor,  itcan,  and  item  are  the  number  of  preset  dc  iteration  for  a  CMOS  inverter, 
a  CMOS  NOR  gate,  a  CMOS  NAND  gate,  and  a  CMOS  transfer  gate  respectively  and  all 
other  terms  are  as  defined  by  Wei.  The  default  number  of  iteration  for  any  primitive  not 
specified  is  one. 
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The  CONTL  card  has  simply  added  one  more  parameter,  gamma,  so  that  the  modified 
predictor  method  can  be  used.  If  the  predictor  scheme  is  chosen  then  gamma  may  be 
specified.  The  default  value  of  gamma  is  zero.  The  form  of  the  CONTL  card  is  now: 

CONTL  la  ten  hstp  Ipred  gamma 
with  laten,  hstp,  and  Ipred  as  defined  by  Wei. 

4.4  Tranditor  P«»erietloa 

In  addition  to  the  input  circuit  description  and  control  file  already  described,  PREMOS 
requires  a  second  data  file  describing  the  transistor  characteristics.  The  modeling  equations 
used  are  [15]  and  [16]: 

lM  -  KP  .  .  (tV.  -  Vt)  V,  -  I  V-,)  (4.1) 


for  operation  in  the  linear  region  and 


Ids 


KP  r  1+Wp, 

2  l+il  (VCI-VT) 


(V0.  »  VT)’ 


(4.2) 


for  operation  in  the  saturation  region  with 


VT 


v„  +  avt 


(4.3) 


and 

KP  »  intrinsic  transconductance 
X  *  channel  length  modulation  parameter 
y\  «  mobility  reduction  parameter 
Vt.  “  threshold  voltage  at  dc  bias 

AVt  ■  threshold  voltage  change  due  to  substrate  bias  voltage  change 


These  parameters  for  each  type  of  transistor  must  be  m  a  file  named  "datai.dat."  Each 
transistor  has  a  set  of  24  parameters,  the  first  four  of  which  are  KP,  X,  tj,  and  V-ro*  The  next 
20  are  AVT  as  represented  in  tabular  form  as  a  function  of  the  source  to  substrate  voltage. 
The  first  set  of  24  is  used  to  describe  all  of  the  NMOS  enhancement  mode  transistors.  The 
second  set  of  24  is  used  to  describe  the  NMOS  depletion  mode  transistors  and  should  have  a 
negative  v«.  The  third  group  of  24  is  for  the  PMOS  enhancement  mode  transistor  and 
should  also  have  a  negative  v„.  Finally,  the  last  group  of  24  is  reserved  for  PMOS 
depletion  mode  transistors,  even  though  no  primitives  have  been  implemented  yet  that  use 
this  type  of  transistor. 

4.5  Onttmt  Proewrina 


In  addition  to  being  able  to  generate  plots  on  standard  printer,  PREMOS  is  also  capable 
of  generating  output  suitable  for  input  to  a  graphics  plotter.  Currently,  the  SEND  command 
will  generate  an  output  data  file  named  "plfile.dat"  which  can  then  be  used  by  a  program 
named  "graf"  for  making  graphs  on  Tektronics  4010  series  terminals.  All  the  user  has  to  do 
is  execute  graf,  which  will  read  in  the  file  plfile.dat  and  then  prompt  the  user  for  all 
necessary  additional  input. 

4.4  SflMjg  Commands 

At  this  point  it  is  appropriate  to  demonstrate  the  use  of  PREMOS  to  analyze  a  particular 
circuit.  The  circuit  chosen  for  this  is  a  ring  oscillator  constructed  from  CMOS  gates  as  shown 
in  Figure  4.6.  A  file  named  "cring.dat"  describing  the  circuit  and  giving  the  control 
commands  for  this  circuit  is  given  m  Figure  4.7.  The  transistor  data  file  (always  named 
"datai.dat")  is  given  if  Figure  4.8.  At  the  time  of  this  writing,  PREMOS  has  been  developed 
in  conjunction  with  the  BSD  4.1  version  of  the  UNIX™  operating  system.  Given  that  the 
executable  code  for  PREMOS  is  contained  m  the  file  named  "prec.ext"  and  letting  bold  face 
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medal  cUc  soure(S  0  ion  Sn  lOn  Sr) 
xi  l  4  S  2  nmnd 
x2  2  3  inv 
x3  3  4  inv 

el  io  elk  oiooio 
opt  03333 
de 

eontl  ill 
tiaa  40n  in 
plot  123 
sand  123 
v*  5 
and 


Figure  4.7  Input  circuit  sad  control 
fils  for  CMOS  osdBator. 


2-0000-05  1.000d-03  1.000d-C3  1.0000*00  0.0000*00  0.0000-00 
o.oooo-oo  o.oooo-oo  o.oooo-oo  o.oooo-oo  o.oooo-oo  o.oooo*oo 

0.0000*00  0.0000*00  0.0000*00  0.0000*00  0. 000d*00  C. 000d*00 
0.0000*00  0.0000*00  0.0000*00  0.0000*00  0.0000*00  0.0000*00 
2.0000-05  1.0000-03  1.0000-03  -2.000000  0.0000*00  0.0000*00 

0.0000*00  0.0000*00  0.0000*00  0.0000*00  0.0000-00  o.oooo-oc 
0.0000-00  0.0000*00  0.0000*00  o.cooo-oc  C. 0000-00  o.oooo-oc 
0.0000*00  0.0000*00  0.0000*00  O.OOOO-OC  C.0000-00  c.ooco-cc 
2 . 0000r05  1.0000-03  1.0000-03  -1.000-00  C. 0000-00  O.OOOO-OC 

0.0000*00  0.0000*00  0.0000*00  0.0000*00  C. 0000-00  c.oooo-oc 
0.0000-00  0.0000*00  0.0000*00  0.0000-00  0.0000-00  0.0000-00 
0.0000*00  0.0000*00  0.0000*00  0.0000-00  0.0000*00  O.OOOO-OC 
2.0000-05  1.0000-03  1.0000-03  2.0000-00  0.0000-00  0.0000*00 

0.0000*00  0.0000*00  0.0000*00  0.0000-00  0.0000*00  o.oooo-oc 
0.0000*00  0.0000*00  0.0000*00  0.0000-00  c.oooo-oc  o.oooo-cc 
0.0000*00  0.0000*00  0.0000*00  0.0000*00  0.0000-00  0.0000*00 
0.0000*00  0.0000*00  0.0000-00  0.0000-00  0.0000-00  o.oooo-oc 


Figure  4.8  Transistor  process  parameter 
file  for  ring  oscillator. 


denote  what  the  user  needs  to  type  m  response  to  various  prompts,  then  a  typical  analysis  of 


a  circuit  would  proceed  as  follows: 


%  prec.ext  < return > 
circuit  file  name?  cring.dat  <return> 
listing  file  name?  cring.out  <retuxn> 
% 


At  this  time,  the  output  data  is  in  the  file  "cring.out"  and  can  be  listed  on  a  terminal  or 
printed  on  paper.  PREMOS  has  also  created  a  new  file  with  the  nwe  "plfile.dat"  for  use 
with  the  graphing  routines.  If  "plfile.dat”  already  existed,  h  is  overwritten  with  the  new 


output. 


The  plotting  routines  have  been  written  using  PLOTIO  software  for  use  with  Tektronics 


4010  series  terminals.  Assuming  the  user  is  now  logged  on  to  an  appropriate  graphics 


terminal,  the  session  could  proceed  as  follows: 


%  graf 

want  hard  copy?  yes  *  1,  no  *  0 
1  <  return  > 

1st  symbol  type-?  0(no)ll(0),2(x),5(*)f8(+),<ll 
1st  line  type-?  l-4(dash),0(solid),-l(no),<ll 
sequential  line  type:  0  1  2  3  4  23  34,  <11 
0  0  <return> 

2nd  symbol  type-  ?  2nd  line  type-  7 
0 1  <retum> 


After  the  user  has  selected  the  desired  style  of  line  for  of  up  to  seven  output  voltages 
(previously  selected  with  the  SEND  command  in  the  input  file),  the  waveforms  will  be 


plotted.  When  the  user  is  ready  to  continue  with  other  commands,  the  "return"  key  must  be 


pressed  to  obtain  another  prompt  from  the  computer.  As  can  be  seen  above,  a  number  of 


possible  plotting  styles  are  given  to  the  user  in  the  prompt  for  the  first  waveform.  For  other 


w'V.-JV ...V v*  vVv  O  *  *.*  v*  a.‘  v  - *_\  ■_  \  --  . 


possibilities,  refer  to  the  PLOTIO  manuals.  The  graphics  output  resulting  from  the  input  file 
"cring.dat”  is  shown  in  Figure  4.9.  Other  sample  circuits  are  include  in  the  appendix. 

4.7  DI*cn?«ioH 

PREMOS  is  a  fairly  easy  program  to  use  provided  the  circuit  can  be  described  with  the 
primitives  provided.  Furthermore,  after  learning  how  to  add  one  primitive,  it  is  fairly  easy 
to  add  new  primitives  as  the  need  arises.  However,  in  the  present  implementation  adding 
new  primitives  increases  the  size  of  the  program,  while  for  speed  it  is  desirable  to  keep  the 
program  small.  Since  the  primitives  are  used  mainly  in  identifying  the  topology  of  a  circuit, 
it  may  be  desirable  to  change  the  structure  of  the  program  so  that  the  topology  of  the  circuit 
is  stored  in  an  intermediate  data  file.  While  allowing  a  much  larger  selection  of  primitives  to 
be  maintained,  this  would  also  have  the  benefit  that  different  control  commands  inputs  could 
be  used  without  the  overhead  of  always  analyzing  the  basic  structure  of  the  circuit.  This 
could  be  valuable  if  the  predictor  method  was  ever  found  to  be  unstable  for  a  particular 


80 


CHAPTER  5 

Conclusions 


Performing  circuit  level  simulation  at  logic  level  speeds  is  an  active  area  of  research.  The 
methods  used  in  the  program  PREMOS  to  increase  the  speed  of  analysis  include  analysis 
sequencing,  latency  checking,  and  circuit  partitioning.  While  these  techniques  do  decrease 
the  amount  of  computer  time  required,  some  important  restrictions  must  be  placed  on  the 
type  of  circuits  allowed.  As  implemented  in  PREMOS,  one  major  requirement  is  that  a 
circuit  must  be  composed  of  available  unidirectional  subcircuits.  The  unidirectional 
requirement  means  that  feedback  within  a  circuit  requires  special  numerical  algorithms.  This 
work  has  attempted  to  show  that  the  predictor  Gauss-Seidcl  method  is  a  viable  approach. 

While  the  predictor  Gauss -Seidel  method  is  not  in  general  stable  for  all  positive  time  step 
sizes,  it  has  been  shown  to  be  stable  for  some  interval  of  step  sizes  in  each  of  seven  studies. 
Perhaps  just  as  important  is  the  comparison  between  the  predictor  and  the  (commonly  used) 
standard  Gauss-Seidel  method.  With  the  proper  choice  of  a  parameter,  the  predictor  method 
can  be  made  stable  over  as  wide  a  range  of  step  size  as  the  standard  method.  The  major 
drawback  with  this  is  that  there  is  no  way  to  determine  the  optimum  value  of  this  parameter 
at  present. 

PREMOS  is  a  program  that  is  accurate  enough  for  many  applications.  However,  for 
fiture  experimental  work  it  may  be  helpful  to  break  the  program  into  smaller,  interacting 
functional  blocks.  At  the  present  time  PREMOS  contains  over  4500  lines  of  code.  While 
this  is  not  nearly  as  large  a  program  as  SPICE,  continued  growth  could  certainly  slow  the 
speed  of  analysis.  Breaking  the  program  into  smaller  functional  blrcks  would  have  the 
disadvantage  that  more  information  would  have  to  be  stored  for  the  blocks  to  be  able  to 


cmos  RS  flip-flop 

model  nand  cmnan(2  1  1 5 f  lOf  50 f ) 

model  elk  soure(5  0  lOn  5n  lOn  5n) 

* 

ndl  1384  nand 
nd2  2395  nand 
nd3  4  7  10  6  nand 

nd4  5  6  11  7  nand 

* 

ss  10  elk  11001100001 

rr  20  elk  00111100101 

en  3  0  elk  11-111110101 
* 

opt  0  3  3  3  3 
dc 

contl  111 
time  120n  In 
send  1267 
v  5 

end 

* 


Figure  A.2  Input  Data  File  for 
Gated  R-S  Flip-Flop. 


cinos  master-slave  RS  flip-flop 
model  inv  cminv(2  2  lOf  20f) 
model  nand  emnan(2  1  1 5 f  lOf  50f) 

model  elk  soure(5  0  lOn  5n  lOn  5n) 

* 

xl  3  8  inv 

ndl  1  3  13  4  nand 

nd2  2  3  14  5  nand 

nd3  4  7  15  6  nand 

nd4  '5  6  16  7  nand 

nd5  6  8  17  9  nand 

nd6  7  8  18  10  nand 

nd7  9  12  19  11  nand 

nd8  10  11  20  12  nand 

* 

ss  1  0  elk  0011000001010 
rr  20  elk  0000110000101 

ck  30  elk  1010101010101 

* 

opt  0  3  3  3  3 
dc 

contl  111 
time  120n  In 
send  1  2  3  11  12 
v+  5 
end 


Figure  A.5  Input  Data  File  for  Master-Slave 
R-S  Flip-Flop. 
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cmos  binary-to-octal  decoder 
model  inv  cminv(2  2  10£  2 0 £ ) 
model  nand  cmnan(2  1  1 5 f  lOf  50 £ ) 
model  nor  emnord  2  15f  lOf  20£) 

model  elk  sourc(5  0  lOn  5n  lOn  5n) 

* 


xl  1  5  inv 
x2  2  6  inv 
x3  3  7  inv 


nrl  5  6  22  8  nor 
nr2  5  2  23  9  nor 
nr3  6  1  24  10  nor 
nr4  2  1  25  11  nor 
nr5  3  4  26  12  nor 
nr6  7  4  27  13  nor 


ndl  12  11  28  14  nand 
nd2  9  12  29  15  nand 
nd3  12  10  30  16  nand 
na4  8  12  31  17  nand 
ndS  11  13  32  18  nand 
nd6  9  13  33  19  nand 
nd7  10  13  34  20  nand 
nd8  8  13  35  21  nand 


ckl 

ck2 

ck3 

ck4 


1  0  elk  0  1  C  1 

2  0  elk  0  0  1  1 

3  0  elk'  0  0  0  0 

4  0  elk  0  0  0  0 


01010101 

00110011 

11000111 

OOliliCO 


opt  0  3  3  3  3 
dc 

contl  111 
time  60n  In 
send  14  15  16 
v+  5 
end 


Figure  A.8 


Input  Data  File  for 
Binary-to-Octal  Decoder. 
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